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Let  y (•)  denote  a random  process  whose  bandwidth,  loosely 

speaking,  goes  to  00  as  e 0 . Consider  the  family  of  differential 

equations  x = g(x  ,y  ) + f(x  ,y  )/a(e)>  where  aCe)  -*■  0 as  e -*■  0 . 

The  question  of  interest  is:  does  the  sequence  { x e ( * ) } converge  in 

some  sense  and  if  so  which,  if  any,  ordinary  or  Ito  differential 

equation  does  it  satisfy?  Normally,  the  limit  is  taken  in  the  sense 

of  weak  convergence.  The  problem  is  of  great  practical  importance, 

for  such  questions  arise  in  many  practical  situations  arising  in  many 

fields.  Often  the  limiting  equation  is  nice  and  can  be  treated  much 

£ 

more  easily  than  can  the  x (•)•  In  any  case,  in  practice  approxima- 
tions to  properties  of  the  xE(*)  are  usually  sought  in  terms  of  e 
and  some  limit.  To  illustrate  these  points,  as  well  as  a related 
stability  problem,  we  give  a practical  example  which  arises  in  the 
theory  of  adaptive  arrays  of  antennas. 

^>The  topic  of  convergence  has  seen  much  work,  starting  with  the 
fundamental  papers  of  Wong  and  Zakai,  and  followed  by  others,  in- 
cluding Khazminskii,  Papanicolaou  and  Kohler,  etc.  From  a non- 
probabilistic  point  of  view,  it  has  been  dealt  with  by  McShane  and 
Sussmann.  In  this  paper,  we  discnlr^a  rather^  general  and  efficient 
method  of  getting  the  correct  limits^  Tneiaea  exploits  some  general 
semigroup  approximation  results  of  Kurtz,  and  often  not  only  gets 
better  results  than  those  obtained  by  preceding  methods,  but  is  also 


easier  to  use. 
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1.  Introduction 


£ 

Let  y (•)  denote  a stationary  random  process  whose  "bandwidth" 

r c 

goes  to  00  as  e 0 , and  define  the  R -valued  process  x (•)  by 
the  O.D.E. 

(1.1)  xE  = g(xe,ye)  + f (xE,yE)/a (e) , xQ  = x(0)  given, 

where  a(e)  0 as  e 0 , and  Ef(x,y£)  = 0 for  each  x.  In  this 
paper,  we  address  the  question:  as.  e -+■  0,  what  is  the  limit  of 
{ x e C * ) ) i in  particular,  does  it  satisfy  an  ordinary  or  Ito  stochastic 
differential  equation,  and  if  so,  what  is  that  equation.  The  question 
arises  frequently  in  applications  in  many  areas.  Often  yE(*)  is  a 
rather  arbitrary  process  and  yet  the  limit  is  a nice  Markov  process 
satisfying,  say,  an  Ito  equation.  Then  many  functionals  of  xe(*) 
can  be  approximated  by  functionals  of  the  limit  and  the  parameter  c, 
for  small  e.  In  applications,  this  is  often  done,  either  explicitly 
or  implicitly.  In  Sections  2 and  6,  one  particular  important  applica- 
tion will  be  discussed. 

The  problem  has  been  around  for  some  time  and  is  a crucial 
aspect  of  the  problem  of  modelling  the  processes  which  arise  in 
practice  by  mathematically  tractable  processes.  Perhaps,  the  first 
mathematical  treatment  was  given  by  Wong  and  Zakai  [1] , [21  who  dealt 
with  equations  of  the  form 

(1.2)  xe  = g(xe)  + f(xe)yE, 

where  yC(.)  was  (more  or  less)  the  derivative  of  a polygonal 
approximation  Y (•)  to  a Wiener  process,  and  Y (•)  converged  to 
that  process  as  e went  to  zero.  Much  subsequent  was  done  on  the 
following  form.  Let  <*(e)  = £,  suppose  that  y(*)  is  a stationary 
bounded  process  and  P(*)  a measurable  function  which  satisfy  the 
strong  mixing  condition  (1.3). 

• OO 

(1.3a)  P1/2(s)ds  < » 

1 0 

(1.3b)  I P { B | A } - P{ B }|  < P(t) 


I 


for  each  t,x  and  each  B e ^(yg,  s > t ♦ t)  and  A ei?(ys,  s < t) . 
Let  yE(t)  = y(t/e2).  Motivation  for  this  scaling  is  given  in  the 
next  subsection.  (We  write  the  values  of  a process  y C * ) as  either 
y(t)  or  y^,  depending  on  notational  convenience.) 


Under  (1.3)  and  other  conditions  on  g and  f,  (1.1)  was  dealt 


with  by  Khazminskii  [3],  Papanicolaou  [5],  Papanicolaou  and  Kohler 
[4] , Papanicolaou  and  Blankenship  [6]  and  Kushner  [71 . The  last 
reference  obtained  perhaps  the  most  general  results  (for  the  time 
invariant  case)  and  allowed  cases  where  ye(>)  could  contain 
(approximations  to)  impulsive  jumps,  and  also  where  y(')  is  un- 
bounded but  the  form  f(x,y)  = f(x)y  was  used. 

Let  C^.  denote  the  space  of  real-valued  functions  on  Rr 

U £ 

which  go  to  zero  as  |x|  ■*  00 , together  with  their  first  i mixed 
partial  derivatives,  and  let  C1  denote  the  subset  with  compact 
support.  Let  subscript  x denote  gradient,  and  define  the  operator 
A on  by 


(1.4)  Ak(x)  = Eg'(x,ys)kx(x)  + j E f 1 (x , yg ) ( f ' (x , yR+ T ) kx (x) )xdx 

- v h rui ak(x)  , 1 v rxi  aZk(x) 

= \ bi(x)-a3T—  + 2 >.aij(x)  W7W7  • 

1 1 l , j J x j 


In  references  [3] -[7],  it  was  proved  (under  various  conditions  on 
f , g , y ( * ) ) that  xe(-)  converged  weakly  to  a diffusion  process  x(*) 
whose  generator  on  functions  is  the  operator  A of  (1.4). 

References  [41  and  [61  contain  a wealth  of  ideas  on  the 
approximation  and  related  problems.  The  methods  used  in  [7]  are 
based  on  general  semigroup  approximation  methods  of  Kurtz  [8].  They 
have  a number  of  advantages  over  previous  methods,  being  somewhat 
easier  to  use  and  giving  better  results  in  many  cases.  The  method 
will  be  described  and  used  in  Section  3. 

Most  past  work  has  dealt  with  showing  that  x(’)  is  a good 
approximation  to  x (•)  in  some  sense.  Only  recently  (see,  e.g. 
[131)  has  the  question  of  dealing  with  the  control  and  stability 
properties  of  x£(*)  in  terms  of  those  of  x(*)  been  considered. 
Reference  [141  deals  with  the  reversed  problem:  finding  x£(*) 
which  are  easier  to  work  with  than  x(-)- 


Discussion  of  properties  of  y£(0  as  e •>  0 . As  £ 0 in 

(1.1)  the  process  f (x£ ,y E) /a ( e)  is  "increasingly  compressed",  hence 
(loosely  speaking)  the  bandwidth  goes  to  °°.  If  f(x,y)  were  not 
divided  by  a(e),  then  the  average  energy  in  the  process  f (x£ ,yt)/a (e) 
(over  any  finite  interval)  would  tend  to  zero  as  e -+•  0 and  the  f 
term  would  play  no  role  in  the  limit.  To  see  the  rough  idea  most 
simply,  let  yE(*)  be  scalar  valued,  let  f(x,y)  = f(x)y,  let  R(*) 
and  S(*)  denote  the  correlation  function  and  spectral  density  of  a 


. - --  - 


stationary  process  y(-)»  consider  the  special  case  where 
y -=  y(t/e  ) and  let  R (•)  and  S (•)  denote  the  correlation 
function  and  spectral  density  of  y E ( • ) . Then  RE(t)  = R(t/E2)  /<*2  (e) 
and 


Se(w) 


eiwtRE(t)dt  = 

- oo 


elwtR(t/E2)dt/u2(c)  = e2S ( e2w) /a 2 ( e)  . 

- OO 


Unless  u(e)  = £,  the  energy  per  unit  bandwith  either  blows  up 
(c/a(e)  ®)  or  goes  to  zero  (e/a(e)  •>  0).  When  “(e)  = e, 

RC(0)  = R(0)/e2  -*■  <*>.  If  the  "magnitude"  of  y e ( t ) / c did  not  go  to  <° 
as  e -*■  00 , then  the  energy  per  unit  bandwidth  would  go  to  zero.  So, 
in  order  to  get  a constant  energy  per  unit  bandwidth  as  e + 0,  we 

7 

need  both  a time  compression  (t/£  scale)  and  an  amplitude  magnitica- 
tion  (a(e)  = e)  . Use  of  this  remark  will  be  made  in  the  next  section. 

In  Section  5,  we  illustrate  the  technique  of  [7]  on  an  important 
class  of  problems  not  explicitly  treated  previously.  For  each  e, 
let  {sf,  i > 0),  denote  a stationary  process.  Define  sG(-)  as  the 
function  which  is  equal  to  s^  on  the  interval  [i,i+l),  set 
= sC(t/e),  and  let  s^  be  "small";  i.e.,  Ef(x,sf)  = 0, 
var  f(x,Sj)  ~ e and  define  x (•)  by 

(1.5)  xe  = g(xe,S£)  + f(xE,5E)/e,  x(0)  = xQ. 

The  exact  forms  of  the  conditions  to  be  used  are  stated  in  Section  5. 
The  form  (1.5)  is  chosen  partly  to  illustrate  the  method.  That  5£(-) 
is  piecewise  constant  makes  the  calculations  a little  easier,  but  is 
not  a particularly  crucial  assumption.  We  will  treat  the  case  where 
f(-,-)  is  linear  in  its  second  argument:  f(x,s)  = f(x)s. 

Equation  (1.5)  is  also  important  from  the  point  of  view  of 
applications.  Consider  the  scalar  valued  discrete  parameter  sequence 

Xn+1  = Xn  + h^Xn,sn^  where  Eh(x,sf)  = Ep(x)  and  var  h(x,sf)  = 

a (x)e.  Then,  setting  q(x,s^)  = h(x,s^)  - Eh(x,s^)  yields  the 
discrete  parameter  version  of  (1.5): 

Cl. 6)  Xn*l  ■ * £P<Xn>  * l(Xn'sn>-  . 

Let  xG(.)  denote  a piecewise  linear  interpolation  of  (XG)  which 

£ “ 

is  linear  in  each  (En,En+c)  and  equals  Xn  at  en.  Then  the  slope 
of  7e(-)  is  P(Xe)  + q(XG,se(En/e))/e  in  [En,En+£).  Thus,  (1.5) 
is  a continuous  parameter  version  of  {Xn>.  The  limits  of  x (•) 
and  of  { x e ( • ) } are  not  necessarily  the  same,  although  in  many  cases 


wc  can  find  g and  g such  that  xE(ne)  = XE  fcrr  all  e,n. 

Let  us  suppose  that  in  (1.1),  “(e)  = c and  yE(t)  = y(t/e2). 
liquation  (1.5)  differs  from  equation  (1.1)  in  that  the  £e(t) 
essentially  become  small  in  some  sense  as  e 0.  But  the  scaling 
is  also  different  (less  compression),  t/e  being  used  in  lieu  of 
t/e  . Equation  (1.5)  (and  (1.6))  correspond  to  a problem  where,  as 
£ -*■  0 , more  and  more  random  effects  affect  the  system,  but  where  the 
individual  effects  became  smaller  and  smaller.  Let  f(x,s)  = f(x)s 
and  write  5E(t)/e  = [sE(t /e)//e]//e  , bringing  the  form  (1.5)  into 
that  of  (1.1)  but  with  /e  replacing  e and  s (t/£)//e  replacing 
yE(t).  But  now,  as  e ->  0 , s (t/e)//e  might  become  unbounded.  Owing 
to  this,  the  methods  used  for  (1.1)  (at  least  when  y(*)  was  assumed 
bounded)  need  to  be  modified  a little  for  use  here. 

. In  Section  2,  we  discuss  a currently  important  problem  concern- 

ing "adaptive"  antenna  arrays,  which  illustrates  one  particular  value 
derived  from  the  type  of  limit  results  with  which  we  are  concerned. 
Sections  3 and  4 describe  Kurtz's  [8]  interesting  method  for  proving 

tightness  and  weak  convergence  of  a sequence  of  not  necessarily 

0 

Markov  processes.  Henceforth,  x is  used  only  for  the  solution  to 

0 

(1.5).  Convergence  of  the  finite  dimensional  distributions  of  x (•) 
to  those  of  a particular  diffusion  x(*)  is  proved  in  Section  5. 

Also,  Section  5 proves  tightness  of  {Xe(-)K  Together  these  results 
yield  that  x (•)  converges  weakly  to  x(*)>  In  Section  6,  we 
return  to  the  antenna  problem,  and  treat  the  problem  of  weak  con- 
vergence and  get  a moment  estimate  for  the  adapting  parameters,  and 
discuss  a related  stability  problem. 

The  use  of  weak  convergence  methods  seems  quite  natural  for  our 
problem.  Often  w.p.l  results  are  meaningless,  since  usually  only  one 
system  (a  fixed  e)  is  to  be  studied,  and  we  seek  approximations  to  its 
properties  in  terms  of  e and  properties  of  the  limit. 

2.  A Problem  in  Adaptive  Antenna  Arrays 

Let  n(*)  = (n^ (•)•■•• »n( • ))  denote  a "wide  band"  complex 
valued  stochastic  process.  We  are  given  an  array  of  r antennas  with 
received  signal  plus  noise  v(t)  = s(t)  + n(t)  = (s^(t)  + n^(t)}, 
s^(t)  and  n^(t)  being  complex  valued.  The  v(t)  is  multiplied  by 
a complex  valued  weight  w,  and  the  object  is  to  find  the  weights 
which  maximize  the  ratio  of  signal  to  noise  power  in  the  output 
w'v(t).  The  problem  is  important  and  of  great  current  interest  (see 
the  papers  in  [9]  or  [10]  and  references  contained  therein).  The 


signal  frequency  is  known,  the  signals  received  by  the  antennas 
differ  only  in  the  phase.  Let  * denote  complex  conjugate.  Let 

'ft 

S„  = (l,exp  i<K,...,exp  i4>  ),  where  <fi . is  the  phase  of  s.(t) 

U & T J J 

relative  to  that  of  s -^  ( t ) , and  let  S be  proportional  to  Sq.  With 
M = En*(t)n'(t),  the  optimum  weight  is  w = kM  S , for  any  constant 
k > 0. 

In  many  applications,  M is  time  varying,  due  to  deliberate 
jamming  attempts,  or  due  to  more  natural  phenomena.  In  fact,  in 
many  applications  n ( • ) is  a strong  competing  signal  which  we  wish 
to  "tune  out"  and  its  covariance  may  vary,  depending  on  the  particular 
use  to  which  the  system  is  put.  We  suppose  (as  is  often  the  case  - 
e.g.,  in  pulsed  radar)  that  the  signal  power  is  much  less  than  the 
noise  power,  so  that  M ~ Ev*(t)v'(t). 

A very  useful  and  relatively  simple  mechanism  for  adapting  the 
weights  (see,  e.g.  110])  can  be  constructed.  The  relevant  equation  is 
(Mt  = v* ( t ) v ' (t)) 

(2.1)  Tw  + (GM+ I ) w = GqS, 

where  t is  a scalar  system  time  constant  and  G and  Gq  are 
system  gains.  Since  M is  the  "square"  of  a wide  band  process,  if  the 
bandwidth  (BW)  goes  to  infinity  and  the  energy  per  unit  BW  does  not 
go  to  zero,  (2.1)  becomes  meaningless.  In  practice,  we  are  interested 
in  both  Ewt  and  in  an  equation  for  an  approximation  to  wt  - Ewt 
for  wide  BW  noise. 

A commonly  used  "engineering"  heuristic  argument  says  that  since 
M(*)  is  wide  band  and  w(-)  is  much  smoother  than  M(-)>  the  two 
are  essentially  independent  and  EM(t)w(t)  = EM(t)Ew(t)  and  that  Ew^. 
approximately  equals  w , the  solution  to 

(2.2)  w + (GM+ I ) w = GqS*. 

Of  course  (2.2)  does  not  give  the  correct  value  of  Ew^.,  even  as  an 
approximation,  unless  the  energy  per  unit  BW  of  the  noise  is  very 
small.  To  see  this,  simply  consider  the  scalar  case  where 
tw  + (Gn  +l)w  = Gq ; solve  it  and  take  expectations.  Since  (2.2)  is 
widely  used,  we  must  find  an  interpretation  with  respect  to  which  it 
makes  sense.  If  (2.2)  is  an  asymptotic  result,  then  it  must  be  satis- 
fied by  a limit  of  solutions  to  (2.1)  (or  their  expectations'),  as  some  parameter 
tends  to  say,  «.  The  comnents  below  are  illustrative  of  the  usefulness  of  the  limit 
results  to  which  this  paper  (and  references  [3] -[7])  are  devoted. 


2 _ 

Let  o denote  the  sum  of  the  eigenvalues  of  M.  In  practice, 

2 

often  a rough  estimate  of  a quantity  proportional  to  a (the  noise 

power)  is  made,  and  an  automatic  gain  control  mechanism  is  used  to 

2 

adjust  G,  usually  decreasing  G as  the  estimate  of  o increases. 
Such  a mechanism  is  crucial  to  the  proper  scaling  of  (2.1),  and  we 
assume  its  use.  In  fact,  suppose  that  for  some  number  K,  G = K/o  . 
Then  (approximately,  actually,  since  we  ignore  the  "signal"  component 
of  M)  with  6M  = M - M, 


(2.3) 


Tw  + (KM /o2  + K(6M)/o2  + I]w  = GQS*. 


2 

As  the  BW  of  n(*)  tends  to  °°,  the  effects  of  K(6M)w/o  become 
negligable  (a  consequence  of  the  type  of  argument  in  [7] , Sections  6 
and  7,  under  reasonable  assumptions  on  n(-))  and  the  limit  is  pre- 
cisely the  solution  of  (2.2).  For  concreteness,  we  consider  the  case 

2 

arising  from  (1.1)  where  n^  = y(t/e  )/e  and  y(‘)  is  a stationary 


bounded  process  and 
Write  6M, 


e = 


as  = 6M^/e 


l/o.  Set  M0  = Ey*y^,6M^  = [y*(t/e2)y'(t/e2)  - Mq]. 

/ 2 I . rii  r Xifl 


and  use  6M.  for  6M 


Now,  define 


6 Wf  = _ w£  and  u = o6w. 


(2.4) 

tue  + [KM/o2  + 
UE  - - i [KH0  ♦ 

As  BW  ■* 

«> , the  effects  of 

becomes 

"white  noise",  in 

process 

B(*)  such  that 

(2.5) 

xdu  + [KM0 

K 

i 

£ A.  2 > 


lE/o2)uE 

e 


t 

Then 

.e  . 


:^)w. 


.e , 


0. 


Q is  obtainable  by  the  method  of  Theorem  5.  We  return  to  this  problem 
in  Section  6,  and  deal  with  the  convergence  problem  and  a related 
stability  problem  when  all  quantities  are  not  complex  valued,  to 
simplify  the  notation. 

3.  Convergence  of  Finite  Dimensional  Distributions 

In  reference  [8] , Kurtz  gave  some  fairly  general  methods  for 
showing  convergence  to  a Markov  process  of  a sequence  of  non-Markov 
processes,  either  in  the  sense  of  weak  convergence  or  in  the  sense  of 
convergence  of  finite  dimensional  distributions.  In  this  section 
and  in  the  next,  we  briefly  describe  his  method.  Later  we  apply  it, 
together  with  an  idea  in  [5] , 16] , to  get  limit  results  in  a fairly 
efficient  manner. 

Sections  3 and  4 are  identical  to  Sections  2 and  3 of  f7] . 

' __  ■ - - - • ' - - - --  ■ - ■ ■■  ’ill  ■*' 


Let  (ft, I’,  5*)  denote  a probability  space,  { 5^1  a nondecreasing 

sequence  of  sub  o-algcbras  of  9,  let  Jzf  denote  the  space  of  pro- 

gessively  measurable  real  valued  processes  k(-)  on  [0,°°),  adapted 

to  { } and  such  that  sup  E|k(t)|  < 00 . Let  kE  and  k be  in 

t e e 

V . Define  the  limit  "p-lim"  by  p-lim  k = k iff  sup  sup  li  | k (t)|  < » 

e>0  t 

and  E|  kE(t)  - k(t)  | -*■  0 for  each  t as  e 0.  lor  each  s > 0, 
define  the  operator  \ ¥ by  _^(s)k  = function  in  ¥ whose 

value  at  t is  the  random  variable  F.  k(t+s).  There  is  a version 

which  is  progressively  measurable  ([8],  Appendix)  and  we  always  assume 
that  this  is  the  one  which  is  used.  The  J?"(s)  , s > 0,  are  a semi- 
group of  linear  operators  on  ¥.  Let  denote  the  subspace 

of  p-right  continuous  functions.  If  the  limit  p-lim  [i  (_f(s)k-k)] 

s->0  s 

A A 

and  exists  and  is  in  Jzq , we  call  it  Ak  and  say  that  k e 0(A). 

The  operators  _^(s)  and  A are  analogous  to  the  semigroup  and  weak 
infinitesimal  operator  of  a Markov  process.  Among  the  properties  to 
be  used  later  is  ([8],  equation  (1.9)) 

fS 

(3.1a)  2t\  s ) k - k = ^"(T)Akdt,  k e 0(A), 

'0 

or,  equivalently, 


(3.1b) 


f s 

k(t+s)  - k(t)  = I E gj  Ak(t  + x)dT: , for  each  t > 0. 
: JO  ^t 


If,  for  some  process  ZG(‘),  = s 5 t)  , we  may  write 

^*G,TG  and  Ae  for  > j^(t)  and  A,  resp. 

The  following  Theorem  (a  specialization  of  [8],  Theorem  3.11) 
is  our  main  tool  for  dealing  with  (1.1)  or  (1.5). 


Theorem  1 . Let  Z e ( • ) = x e ( • ) , 5 £ ( • ) » e > 0,  denote  a sequence  of 

Rr  + r valued  right  continuous  processes,  x(*)  a (Rr-valued)  Markov 

process  with  semigroup  Tfc  mapping  CQ  into  CQ  and  which  is 

strongly  continuous  (sup  norm)  on  Cq.  For  some  X > 0 and  dense  set 

D in  C0,  let  Range  (X - A | D)  be  dense  in  Cq  (sup  norm, 

A = infinitesimal  operator  of  x(-))»  Suppose  that,  for  each  k e D, 

there  is  a sequence  (k£)  of  progressively  measurable  functions 
£ 

adapted  to  i.5^}  and  such  that 


p-lim(ke  - k(xe(*))J  = 0 


(3.2) 


(3.5) 


p-lim[Ake  - Ak(xe(-))1  = 0. 


0 

Then,  if  x (0)  -*■  x(0)  weakly,  the  finite  dimensional  distributions 
o f xL(0  converge  to  those  of  x(*)* 

Equations  (3.2)  and  (3.3)  are  equivalent  to  (the  limits  are 
taken  for  each  t as  e -*•  0) 

(3.2')  sup  E | k e ( t ) - k(xE(t))|  < oo,  E|k£(t)  - k(x£(t))|  -v  0 

(3.3’)  sup  F|Aeke(t)  - Ak(xE(t))|  < ~,  E|AEk£(t)  - Ak(x£(t))|  - 0. 

e,t 

4.  Tightness 

C G 

Let  £ (•)>*  (•)  denote  the  functions  in  the  model  (1.5).  Let 

G G G 

denote  .#(£  , u < t)  and  write  E„.  for  E r . 
t u - t 

Again,  we  describe  results  from  (8).  Let  Dr[0,°°)  denote  the 
space  of  Rr  valued  functions  on  [0,°°)  which  are  right  continuous 

e 

on  (0,°°)  and  have  left  hand  limits  on  (0,°°).  Note  that  x (•)  e 
Dr[0,“)  w.p.l.  Suppose  that  the  finite  dimensional  distributions  of 
xC(- ) converge  to  those  of  a process  x(-)>  where  x(*)  has  paths  in 
Dr(0,°°)  w.p.l.  Then,  as  noted  in  [8],  bottom  of  page  628,  (xe(-)} 
is  tight  in  Dr[0,°°)  if  (k(xe(-))}  is  tight  in  D[0,®)  for  each 
k e C.  (C  is  used  there,  but  it  can  be  replaced  by  c3  or  by  any 
set  of  functions  dense  in  C in  the  sup  norm.)  It  follows  from  [8], 
Theorem  4.20,  that  (k(xe(*))}  is  tight  in  D[0,o»)  if  x^  -*■  Xq 
weakly  and  if,  for  each  real  T > 0,  there  is  a random  variable  Y£(6) 
such  that 

(4.1)  EtV6)  ^ K min(l,  (k(x£+u)  - k(x£)]2}, 
for  all  0<t<T,  0 < u < 6 < 1,  and 

(4.2)  lim  Tim  E Y (6)  = 0. 

6-+-0  e+0 

In  [81,  p.  629,  Kurtz  suggests  a method  of  getting 

This  method  is  developed  in  Theorem  2 and  is  used  in  the 

ke  below  will  be  obtained  in  the  same  manner  as  we  will 

ke  needed  in  Theorem  1.  We  have  (||k||  = sup|k(x)|) 

x 


the  Ye(6). 
sequel.  The 
obtain  the 


J. 


■ 


(4.3)  Et[kK+u)  - k(x*)j‘  < 2 1 | k | | |E£k(x£+u)  - k(xp 


+ lEtkZ(xt+u^  ' k2(xt)l* 

^3  £ 

Theorem  2 . Let  k e C , and  let  there  be  a sequence  { k } in  iz^, 

where  (kE)X  e £^(AE) , i = 1,2,  and  such  that,  for  each  real  T > 0 

0 

there  is  a random  variable  M such  that 
£ £ 

(4.4)  sup  | k (t)  - k(x.)|  -*•  0 in  probability  as  e -*■  0 , 
t<T 

(4.5)  sup | AC (kE (t) ) ^ | < M , i = 1,2,  lim  sup  P{M  > N)  = 0. 

t<T  * N e 

Then  (k(xE(‘))J  is  tight  in  D[0,°°). 

Proo f : We  need  only  show  tightness  of  (kE(*))  on  [0,T],  By  (3.1) 

Ehke(t  + u))i  - (k^t))1  = fUEEAe(ke(t  + T))idT, 

J 0 

from  which  (4.3)  and  (4.5)  yield  that  there  is  a y e ( 6 ) satisfying 
(4.1)  and  (4.2)  for  { k e ( • ) } . Then  (4.4)  and  tightness  of  (kE(')} 
imply  tightness  of  (k(xe(*))K  Q.E.D. 

5.  Convergence  of  the  Sequence  (xE(-)}  of  (1.5) 

We  follow  the  general  line  of  development  in  [7],  using  the 
ideas  in  Sections  3 and  4. 


Assumptions : 


(Al)  Let  f(x,s)  = f(x)s.  The  functions  g(*,*)  and  f(*) 

are  continuous,  the  first  (second,  resp.)  function  havinf 
continuous  first  (second,  resp.)  mixed  partial  x- 
derivatives . 

(A2)  There  is  a constant  K such  that 


I f (X)  I ♦ I g (x , s)  I < K(l+ | x| ) . 


(A3)  { s ^ ) is  bounded  (uniformly  in  e)  and  stationary  and 


Esj  = 0. 


Define  3*^  * #(Sj,  j < i),  9*^  ■ .#(4^,  s < t)  and  let  E^ 
and  E£  denote  the  corresponding  conditional  expectation  operators. 


Throughout  the  sequel,  K denotes  a constant,  whose  value  may  change 
from  usage  to  usage.  Also  in  (A4b,c),  we  let  s|r  denote  an  arbitrary 
scalar  component  of  itself.  It  seems  to  be  most  convenient  to  use 
(A4)  in  its  given  form.  Other  forms,  more  closely  resembling  strong 
mixing  can  be  given.  Strong  mixing  would  imply  the  second  part  of 
(A4a)  and  (A4b,d).  But  something  like  (A4c)  would  still  be  required, 
because  we  need  the  e dependence  there. 


(A4)  a.  There  arc  u.  and  6 > 0 such  that  £ \i\^  < °°  and 

J j J 


>:|s-l2  < K£,  I i^si+ j i . ~} 
b-  • E5i*jsi.j*ki  5 K“j 


. e e 


< Ku  • . 


c.  E1/2|eJSj+.s*  - Esr.,sr.,.I.r  < Ku:e 


e c ,2  „ 1/2+6 

. .s.  -ns.  -S.  . , <Ku.r 

i+j  i+j+k  i+j  l+j+k1  - j 


d.  I |Ejg(x,Sjtj)  ■ Eg(x,sf+j) | < ”, 

£ |BjB)[(x,sf+.)  - f.gx(x,Sjtj)|  < ». 

Let  the  expectation  of  the  sums  in  (d)  go  to  zero  as  e •+  0 , 

uniformly  on  bounded  x - sets  . 

e -t-e  2 

Define  the  operators  At  and  A by  (k(’)  e C ) 

f°°  O 

(5.1)  A^k(x)  = E J f'(x,ej)  [k^(x)f(x,^+u)Jxdu/e2 

-00 

= E Jof'(x,se(t/e)) [k^(x)f(x,se(u+t/e)))xdu/e 

E 1 rN  E 

S'  k(x)  - lim  ^ At  + sk(x)ds. 

N->°°  J 0 


and 


Let  i = [t/e],  which  equals  the  nearest  integer  to  t/ e 
larger  than  t/e.  Then 


(5.2) 


A^k(x) 


1 

e 


l Ef'(x,sf) (k’(x)f(x,s 

j-1  u x 


x 


which  is  not 


+ i Ef(x,sJ)(k’(x)f(x,SQ)]x(e[t/e)  + e - t)/e. 


1/2. 


Note  that  AC  is  A j.  but  with  (‘  [t/e]  + e - t)/c  replaced  by 

(A 5)  The  sum  in  (5.2)  converges  uniformly  in  e for  each  x 
and  k(-)>  Hence  also  uniformly  in  x,  s ince 
f(x,s)  = f(x)s).  There  are  matrix  and  vector  valued 
functions  a ( • ) and  !■>(•)  , resp . , such  that 
(gC(x)  = Eg (x , s f ) ) 


(5.5)  ACk(x)  + k^(x)ge(x)  ->  b’(x)kx(x) 

+ 4 l a . . (x) a2k (x) /3x . 3x  . = Ak (x) 
i , j J J 

uniformly  in  x for  each  k . 


- 2 

(A6)  A is  the  restriction  to  C of  the  strong  infinitesimal 
operator  of  a strong  Markov  diffusion  process  (with  no 
finite  escape  time)  with  semigroup  T , which  maps 
into  Cp  and  is  strongly  continuous  on  Cq. 

~ 2 

(A7)  Tor  some  real  X > 0,  the  set  (X-A)C  is  dense  in  Cq. 


We  note  that  it  is  enough  to  use 


in  (A5)  and  (A6) . 


Remark  on  bounding  the  coefficients.  In  our  case  (since 

f(x,s)  = f(x)s)  there  is  a matrix  o(.)  such  that  a(-)  = o(*)o,(.). 

Suppose  that  b,o,  are  locally  Lipschitz  and  that  x(-)»  the  diffusion 

with  coefficients  a(*) ,b(*) , has  no  finite  escape  time  w.p.l.  Then 

x(*)  has  a representation  as  a solution  to  an  Ito  equation,  and  can 

N 

be  defined  on  Wiener  process  space.  We  say  that  x (•)  is  an 
N-truncation  of  x(')  if  its  drift  and  diffusion  coefficients 
bN(,)»°^(-)  are  bounded  and  equal  b(-) ,o(-)  in  Sw  = {x:  |x|  < N), 

n n 

and  are  obtained  by  a bounding  of  g and  f.  The  process  x (•)  can 
be  defined  on  the  same  Wiener  process  space  on  which  x(*)  is  defined, 
and  then  equals  x(*)  up  until  the  first  exit  time  from  (which 

-*  » w.p.l  as  N -*■  «)  . If  for  each  N,  Theorem  2 (and,  of  course, 
(A6)-(A7))  is  true  for  some  N-truncation,  then  it  is  true  as  stated. 


Remark  on  the  conditions.  (A6)  and  (A7)  are  required  for  the 

use  of  (the  semigroup  approximation)  Theorem  1.  (A7)  is  equivalent 

to  the  strong  infinitesimal  operator  of  the  T*  of  (A6)  being  the 

c ~ 2 

closure  of  the  operator  A of  (5.3)  acting  on  C (or  on  any  set 
sense  in  C2  in  the  norm  ||k||2  3 sup(|k(x)|  + |kx(x)|  ♦ lkxx(x)l))* 


This  condition  does  not  seem  to  be  very  restrictive. 

N 

it  is  often  only  necessary  to  verify  it  for  some  x (•) 


for  each  N. 


As  noted  above 
truncation 

If  the  coefficients  are  bounded  and  uniformly  twice 


continuously  differentiable,  then  ([11],  Chapter  8.4)  T maps  C 

2 2 z 

n and  is  strongly  continuous  on  Cq  with  respect  to  norm 


2 


into 


2* 


We  can  then  replace  Cq  by  C 


'0 


Theorem  1 and  consider  T as  acting  on 


in  (A5) , (A6)  and  in 
2 

t C0  rather  than  on  CQ . 

In  Theorem  3,  we  apply  Theorem  1 to  get  convergence  of  finite 

0 

to  those  of  x(-)  on  an 


dimensional  distributions  of  (x  (•)} 
arbitrary  time  interval  0,T.  Tightness  (Theorem  4)  is  a little 
harder  to  get,  owing  to  the  fact  that  f(x,^^)//e  is  not  necessarily 
uniformly  bounded  as  e -*  0 (see  remarks  in  Section  1),  and  some 
additional  conditions  must  be  used. 


Theorem  3.  Under  (Al)  to  (A7) , the  finite  dimensional  distributions 
0 

of  x (•)  converge  to  those  of  x(*)  as  e -*■  0 , where  x(*)  is  the 

~ 2 

diffusion  with  infinitesimal  operator  A (on  C functions)  and 
initial  condition  x^ . 

Remark . Give  k e c3,  the  main  work  in  using  Theorem  1 is  in  finding 

a suitable  kE  and  proving  (3.2)-(3.3).  Following  a basic  idea  in 

0 0 0 

[5],  [6],  we  will  define  functions  k.  and  k,  such  that  k = k + 

0 0 £ ^ 

k.  + k7  does  the  job.  The  k.  are  constructed  to  guarantee  that 

p-lim  k?  = 0 and  A k = Ak  plus  terms  going  to  zero  in  the  sense 

of  p-lim. 

Proof:  Part  1.  Let  k e C3.  Then  it  is  easy  to  check  that  kp(’)  = 
k(xE(.))  e ^(Ae)  and  that  (write  x = xE,  5 = £E) 

AEkE(t)  = k;(x)[g(x,5)  + f(x,£)/e]. 

Part  2.  We  now  define  k^(t)  = k^fx^.t)  in  such  a way  that 
AEkE(t)  cancels  the  k^ (x) f (x ,5 ) / e term  of  AEkg(t) . Define 

(5.4)  kE(x,t)  = joEEk'(x)(f(x,<;E+s)/e)ds. 

By  (A3),  (A4),  (i  = [t/e] ) 

<x> 

|kf(x,t) | < l |EEk'(x)f(x,sE  .)|  < K l u.. 

j»01A  x J j J 


(5.5) 


Thus,  k^  is  bounded.  By  (A4a)  , E | E^k^  (x)  f (x,  s f + j ) | < K /£.  Thus 
(5.5)  is  bounded  above  by  \ uV^  an<J » hence,  p-lim  k^  = 0. 


fact, 


(5.6) 


Note  that  k^x.t)  is  differentiable  with  respect  to  x. 


which  is  well  defined  for  each  e.  We  can  readily  show  that 

sup  0 | E^kj  (xt  + (j  ,Ct  + ^)  - kj  (xt  t)  | /<S  is  bounded  as  6 -*•  0 and  that 


(5.7) 


p-lim[E^k[(x^+6,t+6)  - kj(x£,t)]/6 

= p-limlE^k^(x^+(S,t+6)  - k J (x  J , t+ 6)  ] / 6 


+ p- limlE^kj (x^, t+6)  - kj(x^,t)]/6 

= (k^Cx^t))-^  -k'(xte)f(xte,5t£)/6 
= k1>x(x^,t)  + ic(x^,t)/e 

where  x^  = g(x^,5^)  + f(x^,5^)/e.  Thus  k^  e £^(AC)  and  Aek^  is 
given  by  (5.7).  The  second  term  on  the  r.h.s.  of  (5.7)  cancels  a 
t^rm  of  Aekg.  Also  the  term  in  k^  x(x^,t)’x^  which  arises  from  the 
g term  goes  to  zero  in  the  p-lim  sense  as  e -*■  0. 

£ * £ £ 

Part  3.  Now  k2  is  to  be  defined  such  that  A k2 ( t ) equals 

Ak(xS  minus  the  dominant  term  in  kf  v(x^,t)'x^  plus  terms  which  go 

to  zero  in  the  p-lim  sense  as  e -*■  0 . Set  k 2 ( t ) * k2(xt,t)  where 


(5.8)  kE(x,t)  - £s{j^  [ki(x)  ^’V5-], 


At.,k(x,{ 


ja,E^(x)(g(x,^+s)  - ^(xjlds  = k2(x,t)  ♦ ^(x,t). 


The  last  term  of 


goes  to  zero  uniformly  in  x,  as  e -*■  0; 


also  (with  x = x ) it  is  in  ©(A  ) and  A k,(x,t)  « k'(x)g  (x) 
plus  terms  going  to  zero  in  the  p-lim  sense  minus  kiCxJgCx.C,.) . This 

a£  £ * ^ 

last  term  cancels  a term  of  A kQ(t).  We  need  to  deal  only  with  the 
first  term  of  (5.8).  Recall  that  A^  k(x)  is  just  the  expected 
value  of  the  intepral  with  respect  to  u of  the  coefficient  of  Et* 


can 


Owing  to  the  integration  with  respect  to  s in  (5.8)  A.+ 

(and  will)  he  replaced  by  A with  little  required  change  in  the  sub- 

~ £ 

sequent  arguments.  Next,  we  show  that  k7  is  well-defined  and 

~ G 

p-lim  k?  = 0.  To  simplify  the  notation,  assume  the  scalar  case  and  let 
fj  represent  cither  f(x,s.)  or  fx(x,Sj).  All  bounds  and  con- 
vergences are  uniform  on  bounded  x-sets. 

We  have^  (i  = [t/e]) 

]k2e(t)i  < K I • 

1 > J 

By  (A4a,b)  the  i , j ^ ^ term  is  bounded  above  by  Ku . and  also 

1/2  J 

by  Km^.  hence  by  K(y.Mj<)  which  implies  that  the  sum  is  bounded. 

-e 

Also,  by  (A4a,c),  p-lim  k2  = 0.  It  can  also  be  readily  shown  that 
k7  £ £^(Ae)  . We  need  only  show  that  AEk7  = minus  argument  of  the  outer 

^ ~ p ^ G ^ C 

integral  of  k7  (at  s = 0)  plus  k,  Y(x£,t)'*.  and  that 

p-lim  k^  x(x^,t)'xt  = 0.  If  this  is  true,  then'  we  have  A (k(xt)  + 

E G E 

k,(t)  + k?(t))  = Ak(x.)  plus  terms  whose  p-lim  is  zero,  and  the  proof 

j L X.  a g"e 

will  be  completed.  That  A k?  has  the  asserted  form  is  not  hard  to 
show,  and  we  only  show  that  p-lim  (x  )’k7  = 0. 

- g c,  9X 

To  get  k.  , we  can  differentiate  under  the  integral  in  (5.8). 

^ f ^ e -*  c £ 

With  the  use  of  the  above  f.  notation,  k2  x C t ) ’ x ^ is  bounded  by  a 
finite  (the  number  not  depending  on  e)  sum  of  terms  of  the  type 
(i  = tt/ej) 


(5.9) 


Qe  ■ iqe/q  £ 

1 9 J 


By  (A4)  , E1/2  | f ^ | 2 < Kc1/2  and  E1/: 2 | E [ f E+  . f E + . + y | 2 < KEEj|fJ+j|w_k  < 
This  and  (A4c)  implies  that 

re  Ffe  fE  I2  < Ke^  + ^/2fLl  U l^2 

E IEifi*jfi+j*k  Efi+jfi+j+kl  * KE  (Mjv  • 

Thus,  by  (A4)  again,  E|QE|  < Ke5^2;  hence,  p-lim  (x^'k^  x = 0.  Q.E.D. 

T ightness . To  prove  tightness  (which  together  with  the  con- 
vergence of  finite  dimensional  distributions  yields,  weak  convergence 
of  {xE(-)>  to  x(*)»  we  make  use  of  Theorem  2.  Both  (4.4)  and  (4.5) 
must  be  shown  for  each  k e c^.  They  do  not  follow  from  (A1)-(A7). 

In  order  to  simplify  matters  we  use  (A8)  also. 


(A8) 


, . e , e 
Let  s . = L_w . , 

1 e i* 

i n e)  and  { w . , i > 
There  are  matrices 


where  L is  a matrix  (bounded  uniformly 


0} 

e 


is  a Markov  chain  for  each  e. 


P . such  that  I | P . | < °°  and  where 

j e e e c 

the  convergence  is  uniform  in  e,  and  = P^w. 


We  use 

E|wj | 7 

Pe(|wf | 
j l 1 


Le 
x 

< Ke 
e,  2 


to  denote  conditioning  on 
7/2 


*rl  + J 


J 

i) 


and 


Ke)  . 


( w i * J < 

.e  e e . , eJ  , e . , 

:.w.  .(w.  - bw.  .(w. 

l l + j v i + j ' i + J i + J 


Let 


Remark . Condition  (A8)  is  quite  realistic 

£ 

, where  the  eigenvalues  of 

.e 


For  example,  let 


w 


n + 1 


Bw  + 6 
n n 

for  each 


B arc  inside  the  unit  circle  and 

e,  {fT}  is  a sequence  of  truncated  independent  identically 

distributed  Gaussian  random  variables  with  covariance  bounded  above 

C G 2 

by  Ke.  Then  (we  use  scalar  case  for  illustration)  F,.(w^  + j) 

l^(*Vj)2  - E(w"  )2|  < |B|^J  (w  f)  2 . 


B 


J:1 


2j(wf)2  + T(var  B£)B2k 
1 k = 0 n 


and 


l (var  B^)|B|2k  where  var 


Ke. 


Theorem  4.  Under  (Al)  _to  (A8)  , (xC(*))  is  tight  on  Drl0,°°)  and 
converges  weakly  to  the  diffusion  x(-)  of  Theorem  3 as  e -*•  0. 


Proof.  Tightness  of  (k(x  (•))) 

/■.  3 

arbitrary  T and  k e C . Define 


on  D [ 0 , T ] needs  to  be  shown,  for 
k.  as  in  Theorem  3.  Since  k., 
ke  e ^(A£),  then  (k£)2  e 0(Ae)  and 


i = 0,1,2,  are  bounded,  and 

A£(ke)2  = 2keAeke.  By  Theorem  2 and  the  estimates  of  Theorem  3 we 

G 2 

only  need  to  show  that  for  each  6 > 0 (i  = 1,2)  P { sup|  k . (t)  | > 6} 

t<T  1 


as  e -►  0 and  that 


cfE 


2 ,x 


)'x  is  bounded  on  (0,T]  by  some  random 


variable  M_.(T)  such  that  sup  P(M  (T)  > A)  -*■  0 as  A -*■  00 . (All  the 


e.  e 


other  components  of  A^k*"  have  this  property  by  the  estimates  in 
Theorem  3.) 

By  (A8)  (using  scalar  case  notation  where  convenient) 


(5.10) 


(5.11) 


sup | kj (t) | < K sup  I I E jsf  . | < K sup  | wf | 
t<T  i<  IT/e]  j=0  1 1 + J i<[T/e]  1 


suplic^Ct)  | < K sup  l I Eisi+  i si+  i + k ‘ Esi+isiH*k 
t<T  L i< (T/e]  j ,k*0  1 1 3 1 J K-  1+J+K 


< K sup  | wf | 2 ♦ Ke. 
i<  [T/e]  1 


. . - 


Similarly 


(5.12)  suplk!;  v(t)'*f|  < 

t-cT 


sil 


t<T 


l 

j,k=0 


..  e c e 

1 1 l+j  i + j+k 


r e e , 
Esi*jsi*j*kl 


< K sup 
i< [T/e] 


I el3 

lw.il 


+ KlwL 


By  the  comments  in  the  first  paragraph  and  the  bounds  (5 . 10) - (5 . 12)  , 
we  need  only  show  that 


(5.13)  Pe  = P{  sup  |wE|  > Ae1/3}  - 0 

i< [T/e]  1 

as  e -►  0 for  each  positive  A.  Bounding  (5.13)  and  using  Chebychev's 
inequality  yields 


Pe  < 


[T/e]  e 1/3 

l P{ | wf | > Ac1/3)  < 

i = 0 


E|wJ|7 


Ke 


1/6 


from  which  we  get  that  (5.10),  (5.11)  go  to  zero  as  desired  and  that 
(5.12)  is  bounded  as  desired.  In  fact  (5.12)  goes  to  zero  in 
probability  as  o -*■  0 . Q.E.D. 

6.  Convergence  and  Stability  for  the  Adaptive  Antenna  Problem 

We  use  the  same  notation  as  in  Section  2.  The  symbols  E^  and 

2 1 

F,t  refer  to  expectation  conditioned  on  y(s),  s < t/ e and  y(s)  , 
s 5 t,  resp.  Write  Kq  = (KMq+I)/i.  As  in  Section  2,  y(*)  is  a (not 
necessarily  piecewise  constant)  stationary  right  continuous  and  bounded 
process.  The  quantities  are  not  complex  valued  here,  to  simplify  the 
notation.  In  general,  deal  with  the  real  and  imaginary  parts 
separately.  Assume 

(Bl)  j |Et6Mt+s|ds  is  bounded  uniformly  in  t,w. 

Define  the  operator  A®  (on  CZ  functions)  by  (wt  is  not  random) 

2 f°° 

A?k(u)  = ^ joW'Edfl'kUu(u)^Vtds* 

(B2)  J~ds  - B«Mtts6Hiis,v|dv 

is  bounded  uniformly  in  t . 


(B3)  Either  let  E be  the  expectation  conditioned  on 

^ E 

0 #(  y ( s ) , s < t + p)  (and  similarly  for  E.)  or  else 
P>0 

* Vt.s  r;t*Pyt*s>'t.i  * Vfs’T.t  (22i 

r J 

also  for  3 order  terms)  in  probability  as  P I 0 , for  each 
posit ive  t , s , x . 

Both  (Bl)  and  (B2)  follow  from  the  strong  mixing  condition  (1.3). 

Define  the  operator  At  by  Atk(u)  = -k^(u)K0u  + A^k(u)  = -k^(u)KQu  + 

i l p. .(t)32k(u)/3u.3u  Set  {p  } - P = QQ’. 

L ; ; 1 J 1 J 1 j 


Theorem  S.  Under  (B1)-(B3),  { u C ( • ) } converges  weakly  in  Dr[0,®) 
to  the  diffusion  (2.5)  with  generator  A^  and  initial  condition 
u (0)  = 0. 


The  proof  follows  the  lines  of  that  of  Theorem  3 quite  closely, 
except,  of  course,  that  the  integrals  cannot  necessarily  be  written 
as  sums.  Also,  due  to  the  scaling  and  to  the  fact  that  y(’)  is 
bounded,  tightness  is  more  easily  proved  than  .in  Section  5.  In 
particular,  k^  -+  0 as  e -*  0 (i  s 1,2)  and  so  docs  (kf  ) 'ue,  all 
uniformly  on  (0,®).  Theorem  5 is  also  a consequence  of  the  results 
in  (4)  (under  their  mixing  condition  (3.1)  on  y(0»  which  implies 
(B1)-(B2)),  (6]  (for  Markovian  y(-))  or  (7).  We  will  give  only  the 
relevant  k^. 

Let  k e C3.  Then  k f ( t ) = k^(u^,t),  where  we  use 
f®  I"  6M  "I 

kitu>t)  ■ ^ J0Bt\c">K.su  * 5tJjs 


k2(u,t) 

We  have  Aeke  = 
as  e -►  0 . 


e (6MfS*v>’ 


6M 


kuu^ 


4^  w-dv  - 


Atk  plus  terms  which  to  to  zero  (uniformly  in 


A^k(u) 

w,t) 


An  extension  of  Theorem  5.  Theorem  5 is  not  very  satisfactory, 
since  we  are  usually  interested  in  uC(-)  for  large  times,  and  would 
like  some  estimates  of  how  close  the  tails  (IT,80)  sections,  for 
large  T)  are  to  the  tail  of  (2.5).  The  weak  convergence  result  of 
Theorem  5 does  not  give  this  to  us.  In  Theorem  6,  we  show  that 


(6.1) 


sup  i:|ue(t  1 1 2 
t>0,e  small 


is  bounded 


(6.2)  { u C ( T + • ) » T > 0,  e small)  is  tight  in  !)r[0,»). 

! rom  these,  we  get  the  more  satisfactory  conclusion  of  Theorem  6.  As 
noted  after  the  proof,  the  method  is  not  sharp  enough  to  get  good 
estimates  for  how  small  e must  he  in  order  for  (6.1)-(6.2)  to  hold. 

Theorem  6.  Assume  (B1)-(B3).  Then  (6.1)  and  (6.2)  hold  and  as  e ■+  0, 

— e 

T 00  in  any  way  at  all  , u (T+>)  converges  weakly  to  the  stationary 
solution  to  (2.5).  Also,  if  u (T+  • ) converges  weakly  to  a process 
u ( • ) £s  T -*■  « , then  u Z ( • ) converges  weakly  to  the  stationary 
solution  of  (2.5).  (Indeed,  it  can  be  easily  shown  from  (6.1)  and  the 
stationarity  of  y ( • ) that  (u  (T+  • ) , T > 0}  is  tight  for  each 
small  e . ) 

Proof:  Only  a sketch  will  be  given.  The  stability  idea  is  essentially 

that  in  (6],  except  that  it  is  used  in  a slightly  different  way  and 
that  Kurtz's  results  must  be  used  since  y(*)  is  not  assumed  to  be 
Markovian.  Let  P be  a positive  definite  symmetric  matrix  such  that 
u'Pu  = k(u)  is  a Liapunov  function  for  u = -K^ii,  and  C = -(KqP+PKq) 
is  negative  definite.  In  the  proofs  of  Theorems  3,  4,  5,  it  was  re- 

~ 3 ^ 3 

quired  that  k(-)  e C . Our  k(-)  here  is  not  in  C , but  it  makes 
little  difference  in  the  proofs.  This  is  because  y(-)  is  bounded  and 

fT  e e 

because  we  could  use  the  form  E.(-)  for  the  k.  and  get  estimates 

] t z 1 

(6.1)  for  t < T,  and  which  do  not  depend  on  T.  In  order  to  simplify 

a 7 

the  argument,  we  ignore  the  fact  that  k(’)  t-  C . 

We  have  (use  u ■ ufc>  w = 6M  = 6Mt) 

Aek(u)  = -u’Cu  - |u'[(6Me)’P  + P(6Me)]u  - \ (u'P  ♦ w'(^-)'Pu]. 

Following  the  method  of  Theorem  4,  define  k^  such  that  AEk^  cancels 
all  the  terms  of  Aek(u),  except  for  the  first.  Set  k^(t)  ■ k^(uE,t) 
where 

(6.3)  kE(u,t)  - 4 j^Etu'1  (6^t>s),p  + P6^t+s  u,ds 

- ^ j^Etlu'P5l*t*s"  4 w'(6ME+s)’Pu)ds. 


Note  that,  by  the  change  of  variable  s/t  s, 

2 r00 

(6.4)  k , (u , t)  = F-V  [6M(s+t/£2)  'P  + P6M(s+t/£2) ] u ds 

1 Jo1 

#oo  _ 

- Ej[u'P6M(s+t/cZ)w  + w' 6M(s+t/£Z) 'Pu]ds. 

T n t 


Now , 


(6.5)  Aekf(u,t)  = -(last  two  terms  of  ACk(u))  + (k.E  (u,t))'u* 

J.  1 y U 

G * G G 

The  function  k2(t)  is  c*10Scn  ’n  a waV  -such  that  A k 2 ( t ) 

cancels  the  part  of  the  last  term  of  (6.5)  which  is  not  0(f)  (to 

2 

see  which  terms  are  0(f)  or  0(1),  change  variables  s/e  s)  . 
Thus  set 


2 #«>  .00 

.6)  kE(t)  = ds  (eJw- 

c Tze^  J 0 1 0 z 


(6Mt£  + s)'P6<  + s + vwt  ' Ctldv 


where 


r°o 

Ct  = (2K2/!2)  j Ew'ffiMjP'PdMg.^ds. 


Now, 


A k2(t)  = -(dominant  part  of  last  term  in  (6.5)  + . 

2 2 

Ry  a change  of  variables  s/e  -*-s,  v/e  -*•  v and  (B2)  , it  can  be 
£ 2 

verified  that  k2  = 0(e  ). 

Combining  the  foregoing  together  with  (B1)-(B2)  yields,  for 
k - k ♦ + k2»  u = ut, 


Aek  (t)  = -u'Cu  + Ct  ♦ BE(u,wt)E, 
k(u)  < ke(u,t)  ♦ e|B£(u,wt) | , 

where  Be(*)  and  5e(*)  are  both  the  sum  of  quadratic  forms  in  u 
and  bilinear  forms  in  (u,wt)  and  with  coefficient's  that  are  bounded 
uniformly  in  E,w,t. 

By  (3.1)  and  the  above  estimates,  there  are  positive  real  C,& 
and  Eg  such  that  for  £ < e^, 


I 


(6.7)  Kk(uJ)  5 likc(Up) 


•i: 


I k (u^)dv  + 


C ilv  + e|  Bc(u^,ws)  | 


Since  uQ  = 0,  (6.7)  implies  (6.1). 

Given  (6.1),  it  is  not  hard  to  show,  via  the  method  of  Section  4 
(see  also  Section  5 of  [7))  that  (6.2)  holds.  Also,  the  limit  of  any 
weakly  convergent  sequence  (as  c ♦ o)  must  converge  to  some  solution 
of  (2.5). 


We  only  need  to  show  that  as  e -*■  0 , T > °°,  any  subsequence 

converges  to  the  stationary  solution  of  (2.5).  Fix  T.  > 0 and  take 

c ^ 

a convergent  subsequence  of  (u  (T+*))  (e  0 , T -♦  **>)  . Take  a further 
subsequence  of  the  subsequence,  such  that  the  (T-Tj ,«)  sections  are 
weakly  convergent  also.  Let  u(*)  and  u(-)  denote  the  weak  limits 
on  l)r[0,°°)  of  the  IT,00)  and  (T-Tj,00)  sections,  resp.  Then  u(-) 

is  just  the  [T.  ,°°)  section  of  u(*).  In  particular,  u(0)  = u(T..). 

£ , * 
Since  (u  (t)}  is  tight  and  T^  arbitrary  and  u = -KqU  asymptoti- 
cally stable,  we  get  that  any  limit  as  T -*•  00 , c -►  0 must  be  the 
stationary  solution  of  (2.5).  Q.F..D. 


Remark . Theorem  6 is  preferable  to  Theorem  5,  but  since  depends 

on  the  maximum  magnitude  of  y(-)»  we  do  not  get  a good  estimate  of 
the  stability  region.  Some  other  approach  seems  to  be  needed  for 
this.  We  have  tried  to  combine  the  above  ideas  with  the  ideas  in 
stochastic  stability  for  linear  systems  with  coefficient  variations 
(such  as  those  based  on  Gronwall's  Lemma  [151  -.117] ) but  without  much 
success  so  far. 
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